Molecular Microbiology 1201 1) 80(6), H17-U19 ■ 



doi:10.1 1 1 1/j. 1365-2958. 201 1 .07699.x 



MicroCommentary 
Cutting out the c|>C31 prophage 



W. Marshall Stark* 

Institute of Molecular, Cell and Systems Biology, 
University of Glasgow, Glasgow G12 8QQ, UK. 



Summary 

To establish a lysogenic lifestyle, the temperate bac- 
teriophage <))C31 integrates its genome into the chro- 
mosome of its Streptomyces host, by site-specific 
recombination between attP (the attachment site in 
the phage DNA) and attB (the chromosomal attach- 
ment site). This reaction is promoted by a phage- 
encoded serine recombinase Int. To return to the lytic 
lifestyle, the prophage excises its DNA by a similar 
Int-mediated reaction between the recombinant sites 
flanking the prophage, attL and attR. <{>C31 Int has 
been developed into a popular experimental tool for 
integration of transgenic DNA into the genomes of 
eukaryotic organisms. However, until now it has not 
been possible to use Int to promote the reverse reac- 
tion, excision. In many other phages, the presence of 
a recombination directionality factor (RDF) protein 
biases the phage-encoded integrase towards proph- 
age excision, whereas absence of the RDF favours 
integration; but the <{>C31 RDF had proved elusive. In 
this issue of Molecular Microbiology, Khaleel era/. 
(2011) report the identification and purification of the 
<|)C31 RDF, and show that it both promotes excision 
and inhibits integration by direct protein-protein inter- 
actions with Int itself. 

Virulent bacteriophages destroy their host cell as they 
replicate and release new phage particles into the sur- 
rounding medium, but many 'temperate' phages can also 
adopt a lysogenic lifestyle where they hide in the host cell 
and get passed on from generation to generation. The 
iconoclastic discovery that phage X integrates its DNA into 
the host genome to establish lysogeny was followed by a 
surge of groundbreaking research on the integration 
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mechanism. It was revealed that the phage uses a site- 
specific recombinase enzyme (Int) to paste its circular 
DNA genome into the chromosome of the bacterial cell, by 
breaking and rejoining the strands at two sites called attP 
(on the phage) and attB (on the chromosome). The 
so-called prophage can remain quietly embedded in the 
host cell's DNA for thousands of generations, but eventu- 
ally, to re-enter the lytic lifestyle and produce more infec- 
tious particles, the phage genome must be cut out and 
recircularized. Both excision and integration are promoted 
by the same enzyme, Int, but how does it know which way 
to go? Phage X controls directionality with a protein called 
Xis. In the absence of Xis, Int promotes the attBx attP 
integration reaction, but has no activity on the recombi- 
nant attL and attR sites that flank the prophage DNA. 
However, in the presence of Xis the reverse is true; Int 
promotes attL x attR, but not attPxattB recombination. 
This switch is brought about by binding of Xis to the 
phage-encoded recombination site DNA; binding to attP 
inhibits integration, whereas binding to attR strongly 
stimulates excision. It is thought that Xis modulates Int 
activity by distorting the path of the DNA within the elabo- 
rate protein-DNA complexes formed when two att sites 
come together (Azaro and Landy, 2002). 

X Int was the first site-specific recombinase to be 
characterized. We now know of many hundreds of related 
enzymes, which form a group known as the tyrosine 
recombinases - so-called because a conserved tyrosine 
residue is the active site nucleophile that breaks the phos- 
phodiester of the DNA backbone, becoming covalently 
attached to the DNA in the process. Many other phages 
can adopt a lysogenic lifestyle like X, and for a long time all 
known phage integrases were tyrosine recombinases. 
However, in the 1990s it was discovered that some tem- 
perate phages, including <|>C31, use integrases belonging 
to a very different family of enzymes, the serine recombi- 
nases (Smith and Thorpe, 2002; Smith era/., 2010). 
Serine recombinases were already known as a close-knit 
family of small proteins (~ 200 amino acids) that promote 
programmed genetic rearrangements such as resolution 
of transposition intermediates, plasmid monomerization, 
and inversion of the orientation of genomic DNA 
segments. They use a serine residue rather than a 
tyrosine to break the DNA strands, and are structurally 
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and mechanistically unrelated to the tyrosine recombi- 
nases (Grindley etal., 2006). The newly identified phage 
integrases had an N-terminal domain that could be 
aligned with the known serine recombinases, but they 
were much larger proteins, typically 500-800 amino 
acids. The functions of the large C-terminal extensions 
were unknown at first. As <t)C31 Int and related systems 
were characterized, initially in vivo and then in vitro 
(Thorpe and Smith, 1998; Thorpe era/., 2000), a curious 
difference from the X-like systems became apparent; 
whereas X Int promotes recombination at a large, complex 
phage affP site (~ 240 bp), both the affPand attB sites for 
serine integrases are short (30-40 bp), with imperfect 
dyad symmetry suggesting that each site might just bind a 
dimer of Int. Furthermore, the in vitro studies showed that 
4>C31 Int is the only protein that is required for affPx attB 
recombination, whereas X Int requires an additional 
protein IHF. It was not at all clear how serine integrases 
could control directionality with such simple recombina- 
tion sites (Smith and Thorpe, 2002). And yet, <t)C31 Int did 
not promote recombination between attL and afffi in vitro. 
The experiments implied that an excision-specific factor 
was missing. 

The simplicity of the recombination sites used by 
4>C31 Int and its capacity for one-way integration imme- 
diately attracted the attention of those working in the 
nascent field of genome engineering. As a result, cf)C31 
Int has become a recombinase of choice for efficient 
integration of transgenic DNA into specific genomic loci 
of model organisms (Calos, 2006). The opportunities for 
application of this system might be expanded consider- 
ably if it could promote unidirectional excision as well as 
integration. 

The first 'RDF' (recombination directionality factor, as 
proteins with Xis-like function have become known) of a 
serine integrase was identified in 1999 in phage TP901-1 
(Breuner era/., 1999). Two other serine integrase RDFs 
followed, and it became apparent that phages can use a 
variety of unrelated proteins for this purpose. The discov- 
ery of proteins related to these RDFs in phages that do not 
have serine integrases has led to the suggestion that at 
least some RDFs have alternative functions, and have 
been co-opted by the integrase system to regulate direc- 
tionality. Prior to the work of Khaleel etal. (2011), the best 
understood serine integrase RDF was that of the Mycobac- 
terium phage Bxb1 (gp47). gp47 was purified and shown to 
interact directly with Bxb1 Int in vitro, forming a stable 
complex when Int is itself bound to an att site (Ghosh ef a/., 
2006; 2008), unlike Xis from phage X, which can bind to aff 
sites independently of Int. Meanwhile however, the 4>C31 
RDF was proving hard to catch; no proteins in the cf>C31 
genome resembled any of the previously identified RDFs. 

Khaleel ef a/, guessed that the <|)C31 RDF might interact 
directly with Int, like the Bxbl RDF. They therefore set out 



to test a set of 28 <])C31 -encoded proteins to see if any of 
them bound to Int in an £. co//'-based 'two-hybrid' assay. 
Only two proteins (other than Int itself) were recovered 
from this screen, and of these one, a 27.5 kDa protein 
called gp3, turned out to be the elusive RDF. The two- 
hybrid assays also showed that gp3 subunits interact, 
suggesting that it might be functionally multimeric. 
Recombination assays showed that gp3 dramatically 
stimulated Int-mediated attL x attR recombination and 
inhibited afrPx attB recombination. Intriguingly, gp3 also 
stimulated recombination between two attL sites or two 
attR sites but not other combinations of aff sites, lending 
support to the proposal (Ghosh etal., 2008) of a 'rule' that 
in a productive pairing of sites, each Int subunit bound to 
an affP-derived half-site must pair with an Int bound to an 
affS-derived half-site, and vice versa. 

To gain more detailed insight into the effects of gp3, the 
authors purified the protein. Using a 'pulldown' assay with 
gp3 immobilized on beads, they showed that it bound 
specifically to Int, even in the absence of any aff site DNA. 
They then studied the complexes formed by gp3 together 
with Int and the aff sites using electrophoretic mobility shift 
assays (EMSA). These experiments revealed a number of 
important features. First, there was no interaction of gp3 
with the aff site DNA in the absence of Int, suggesting that 
unlike X Xis, gp3 may act entirely by protein-protein inter- 
actions. Int on its own forms monomer and dimer com- 
plexes with all the aff sites, and can also be observed to 
make a synapse between attP and attB, but not between 
attL and attR. When gp3 and Int are both present, new, 
lower-mobility complexes of single aff sites are observed, 
presumed to contain subunits of both proteins. But most 
importantly, a synapse between attL and afffi can now be 
observed, while formation of the attPxattB synapse is 
inhibited. Thus, gp3 acts at the synapsis stage of recom- 
bination, to favour the excision synapse and disfavour the 
integration synapse. In accord with the results of the 
recombination assays in E. coli, synapsis of two attL sites 
or two attR sites could also be detected, but no synapses 
were seen between other combinations of sites. 

Is the RDF (gp3) present in the synaptic complexes, 
or does it act by remodelling the Int-DNA complexes and 
then dissociating? Previously, the Smith group had iso- 
lated an Int mutant that can synapse and recombine 
either attPxattB or attL x attR without gp3. Using this 
mutant, it was shown that the electrophoretic mobilities 
of both attP-attB and attL-attR synaptic complexes are 
shifted in the presence of gp3, suggesting that gp3 is in 
the complexes. Furthermore, a kinetic analysis of the 
excision reaction suggested a stoichiometric interaction 
of gp3 with Int. 

Figure 1 summarizes our understanding of the mecha- 
nisms of cf)C31 Int-mediated integration and excision, fol- 
lowing the work of Khaleel etal. Their results now provide 
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Fig. 1. Mechanism of <|)C31 integration and excision. Int subunits 
are represented by grey circles (for integration) or triangles (for 
excision), to indicate the proposed protein conformational changes. 
Note that the actual stoichiometry of the Int and RDF subunits 
remains unknown. Black and red lines represent the host and 
phage genomic DNA. 

us with all the pieces of the jigsaw needed to study the 
complete system in molecular detail. We still have a lot to 
learn about this fascinating family of enzymes, including 
the stoichiometry of the synaptic complexes and the 
nature of the proposed RDF-induced structural changes. 
And of course, we eagerly await crystallography! 
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